
LEONARD , JOHN E . 
NEWMAN, ROLAND A. 
REFF, MITCHELL E. 
RASTETTER, WILLIAM H. 



(ii) TITLE OF INVENTION: THERAPEUTIC APPLICATION OF CHIMERIC AND 
RADIOLABELED ANTIBODIES TO HUMAN B LYMPHOCYTE RESTRICTED 
DIFFERENTIATION ANTIGEN FOR THE TREATMENT OF B CELL 
LYMPHOMA 

(iii) NUMBER OF SEQUENCES: 11 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PILLSBURY WINTHROP 

(B) STREET: 1100 New York Avenue, N.W. , Ninth FL. 

( C ) CITY : Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP : 20005 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/921,060 

(B) FILING DATE: 29-AUG-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/149,099 

(B) FILING DATE: 03 -NOV- 1993 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/978,891 

(B) FILING DATE: 13 -NOV- 1992 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Teskin, Robin L. 

(B) REGISTRATION NUMBER: 3 5,03 0 

(C) REFERENCE/DOCKET NUMBER: 037003-0275463 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-861-3000 

(B) TELEFAX: 202-822-0944 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GGGAGCTTGG ATCGATCCTC TATGGTT 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 
AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 120 
GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 
ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 24 0 
GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 300 
GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 360 
AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 
GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 480 
ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 54 0 
TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 600 
AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 660 
ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 720 
ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 780 
TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 84 0 
GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 
CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 960 
CATCACAGAT CTCTCACCAT GAGGGTCCCC GCTCAGCTCC TGGGGCTCCT GCTGCTCTGG 1020 
CTCCCAGGTG CACGATGTGA TGGTACCAAG GTGGAAAT C A AACGTACGGT GGCTGCACCA 1080 
TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 1140 
TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 12 00 
CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 1260 
AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 1320 
TGCGAAGTCA CCCATCAGGG CCTGAGCTCG CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 13 80 
TGTTGAATTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT GACAACATGC 1440 
GGCCGTGATA TCTACGTATG ATCAGCCTCG ACTGTGCCTT CTAGTTGCCA GCCATCTGTT 1500 
GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 1560 
TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 1620 
GGGGTGGGGC AGGACAGCAA GGGGGAGGAT TGGGAAGACA ATAGCAGGCA TGCTGGGGAT 1680 



GCGGTGGGCT CTATGGAACC AGCTGGGGCT CGACAGCTAT GCCAAGTACG CCCCCTATTG 1740 
ACGTCAATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 1800 
TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT 1860 
GGCAGTACAT CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC 1920 
CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 198 0 
GTAACAACTC CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 2 040 
TAAGCAGAGC TGGGTACGTC CT C AC ATT C A GTGAT CAGCA CTGAACACAG ACCCGTCGAC 2100 
ATGGGTTGGA GCCTCATCTT GCTCTTCCTT GTCGCTGTTG CTACGCGTGT CGCTAGCACC 2160 
AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 222 0 
GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 2280 
GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGG ACT CTAC 2 34 0 
TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 24 00 
AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGCAGAGCC CAAATCTTGT 2460 
GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 252 0 
TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 2580 
TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 2 64 0 
GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 2700 
CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGACTACAAG 2760 
TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 2 82 0 
GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGAC C AGG 2880 
AACCAGGTCA GCCTGACCTG CCTGGT CAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 2 940 
TGGG AG AG C A ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 3 000 
GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 3 06 0 
AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 312 0 
CTCTCCCTGT CTCCGGGTAA ATGAGGATCC GTTAACGGTT ACCAACTACC TAGACTGGAT 318 0 
TCGTGACAAC ATGCGGCCGT GATATCTACG TATGAT CAGC CTCGACTGTG CCTTCTAGTT 324 0 
GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC 33 00 
CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 3 3 60 
CTATTCTGGG GGGTGGGGTG GGGCAGGACA GCAAGGGGGA GGATTGGGAA GACAATAGCA 342 0 
GGCATGCTGG GGATGCGGTG GGCTCTATGG AACCAGCTGG GGCTCGACAG CGCTGGATCT 34 80 
CCCGATCCCC AGCTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 354 0 
AATTTTAACA CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 3600 
AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 3 66 0 
GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CAT C AC CGAA 372 0 
GCCTGATTCC GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG GATAGAGAGG 37 8 0 
GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC ACATTTGCTT 3 84 0 
CTGACATAGT TGTGTTGGGA GCTTGGATAG CTTGGACAGC TCAGGGCTGC GATTTCGCGC 3 900 
CAAACTTGAC GGCAATCCTA GCGTGAAGGC TGGTAGGATT TTATCCCCGC TGCCATCATG 3 960 
GTTCGAC CAT TGAACTGCAT CGTCGCCGTG TCCCAAAATA TGGGGATTGG CAAGAACGGA 4 020 
GACCTACCCT GGCCTCCGCT CAGGAACGAG TTCAAGTACT TCCAAAGAAT GACCACAACC 4 080 
TCTTCAGTGG AAGGTAAACA GAATCTGGTG ATTATGGGTA GGAAAAC CTG GTTCTCCATT 414 0 
CCTGAGAACA ATCGACCTTT AAAGGACAGA ATTAATATAG TTCTCAGTAG AGAACTCAAA 42 00 
GAACCACCAC GAGGAGCTCA TTTTCTTGCC AAAAGTTTGG ATGATGCCTT AAGACTTATT 42 60 
GAACAACCGG AATTGGCAAG TAAAGTAGAC ATGGTTTGGA TAGTCGGAGG CAGTTCTGTT 432 0 
TACCAGGAAG CCATGAATCA ACCAGGCCAC CTTAGACTCT TTGTGACAAG GATCATGCAG 43 8 0 
GAATTTGAAA GTGACACGTT TTTCCCAGAA ATTGATTTGG GGAAATATAA ACTTCTCCCA 444 0 
GAATACCCAG GCGTCCTCTC TGAGGTCCAG GAGGAAAAAG GCAT CAAGTA TAAGTTTGAA 4500 
GTCTACGAGA AGAAAGACTA ACAGGAAGAT GCTTTCAAGT TCTCTGCTCC CCTCCTAAAG 4560 
TCATGCATTT TTATAAGACC ATGGGACTTT TGCTGGCTTT AGATCAGC CT CGACTGTGCC 4 620 
TTCTAGTTGC CAGCCATCTG TTGTTTGCCC CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG 468 0 
TGCCACTCCC ACTGTCCTTT CCTAATAAAA TGAGGAAATT GCATCGCATT GTCTGAGTAG 474 0 
GTGTCATTCT ATTCTGGGGG GTGGGGTGGG GCAGGACAGC AAGGGGGAGG ATTGGGAAGA 4800 
CAATAGCAGG CATGCTGGGG ATGCGGTGGG CTCTATGGAA CCAGCTGGGG CTCGAGCTAC 4 860 
TAGCTTTGCT TCTCAATTTC TTATTTGCAT AATGAGAAAA AAAGGAAAAT TAATTTTAAC 4 92 0 
ACCAATTCAG TAGTTGATTG AGCAAATGCG TTGCCAAAAA GGATGCTTTA GAGACAGTGT 4 980 
TCTCTGCACA GATAAGGACA AACATTATTC AGAGGGAGTA CCCAGAGCTG AGACTCCTAA 504 0 
GCCAGTGAGT GGCACAGCAT TCTAGGGAGA AATATGCTTG TCATCAC CGA AGCCTGATTC 5100 
CGTAGAGCCA CACCTTGGTA AGGGCCAATC TGCTCACACA GGATAGAGAG GGCAGGAGCC 5160 
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AGGGCAGAGC ATATAAGGTG AGGTAGGATC AGTTGCTCCT CACATTTGCT TCTGACATAG 522 0 
TTGTGTTGGG AGCTTGGATC GATCCTCTAT GGTTGAACAA GATGGATTGC ACGCAGGTTC 5280 
TCCGGCCGCT TGGGTGGAGA GGCTATTCGG CTATGACTGG GCACAACAGA CAATCGGCTG 534 0 
CTCTGATGCC GCCGTGTTCC GGCTGTCAGC GCAGGGGCGC CCGGTTCTTT TTGTCAAGAC 54 00 
CGACCTGTCC GGTGCCCTGA ATGAACTGCA GGACGAGGCA GCGCGGCTAT CGTGGCTGGC 546 0 
CACGACGGGC GTTCCTTGCG CAGCTGTGCT CGACGTTGTC ACTGAAGCGG GAAGGGACTG 552 0 
GCTGCTATTG GGCGAAGTGC CGGGGCAGGA TCTCCTGTCA TCTCACCTTG CTCCTGCCGA 55 8 0 
GAAAGTATCC ATCATGGCTG ATGCAATGCG GCGGCTGCAT ACGCTTGATC CGGCTACCTG 5 64 0 
CCCATTCGAC CACCAAGCGA AACATCGCAT CGAGCGAGCA CGTACTCGGA TGGAAGC CGG 5700 
TCTTGTCGAT CAGGATGATC TGGACGAAGA GCATCAGGGG CTCGCGCCAG CCGAACTGTT 576 0 
CGCCAGGCTC AAGGCGCGCA TGCCCGACGG CGAGGATCTC GTCGTGACCC ATGGCGATGC 582 0 
CTGCTTGCCG AATATCATGG TGGAAAATGG CCGCTTTTCT GGATTCATCG ACTGTGGCCG 5880 
GCTGGGTGTG GCGGACCGCT ATCAGGACAT AGCGTTGGCT ACCCGTGATA TTGCTGAAGA 594 0 
GCTTGGCGGC GAATGGGCTG ACCGCTTCCT CGTGCTTTAC GGTATCGCCG CTCCCGATTC 6000 
GCAGCGCATC GCCTTCTATC GCCTTCTTGA CGAGTTCTTC TGAGCGGGAC TCTGGGGTTC 606 0 
GAAATGACCG ACCAAGCGAC GCCCAACCTG CCATCACGAG ATTTCGATTC CACCGCCGCC 612 0 
TTCTATGAAA GGTTGGGCTT CGGAATCGTT TTCCGGGACG CCGGCTGGAT GATCCTCCAG 6180 
CGCGGGGATC TCATGCTGGA GTTCTTCGCC CACCCCAACT TGTTTATTGC AGCTTATAAT 624 0 
GGTTACAAAT AAAGCAATAG CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT 63 00 
TCTAGTTGTG GTTTGTCCAA ACTCATCAAT CTATCTTATC AT GT CTGGAT CGCGGCCGCG 63 60 
ATCCCGTCGA GAGCTTGGCG TAATCATGGT CATAGCTGTT TCCTGTGTGA AATTGTTATC 6420 
CGCTCACAAT TCCACACAAC ATACGAGCCG GAAGCATAAA GTGTAAAGCC TGGGGTGCCT 64 80 
AATGAGTGAG CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA 6 54 0 
ACCTGTCGTG CCAGCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA 6600 
TTGGGCGCTC TTCCGCTTCC TCGCTCACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC 6 660 
GAGCGGTATC AGCTCACTCA AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG 672 0 
CAGGAAAGAA CATGTGAGCA AAAGGC CAGC AAAAGGCCAG GAAC CGTAAA AAGGCCGCGT 6 780 
TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA TCACAAAAAT CGACGCTCAA 6 84 0 
GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAG AT AC C A GGCGTTTCCC CCTGGAAGCT 6 900 
CCCTCGTGCG CTCTCCTGTT CCGACCCTGC CGCTTAC CGG ATACCTGTCC GCCTTTCTCC 6960 
CTT CGGGAAG CGTGGCGCTT TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG 7 020 
TCGTTCGCTC CAAGCTGGGC TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT 7 080 
TATC CGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG 714 0 
CAGCCACTGG TAACAGGATT AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA 72 00 
AGTGGTGGCC TAACTACGGC TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA 7260 
AGCCAGTTAC CTT CGGAAAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG 7 320 
GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC AGATTACGCG CAGAAAAAAA GGATCTCAAG 73 80 
AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG 744 0 
GGATTTTGGT CATGAGATTA TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATTAAAAAT 7 5 00 
GAAGTTTTAA ATCAATCTAA AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT 7560 
TAATCAGTGA GGCACCTATC TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC 7 62 0 
TCCCCGTCGT GTAGATAACT ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 7680 
TGATACCGCG AGACCCACGC TCACCGGCTC CAGATTTATC AGCAATAAAC CAGCCAGCCG 774 0 
GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA CTTTATCCGC CTCCATCCAG TCTATTAATT 7 800 
GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC CAGTTAATAG TTTGCGCAAC GTTGTTGCCA 7860 
TTGCTACAGG CATCGTGGTG TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT 7 920 
CCCAACGATC AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT 7980 
TCGGTCCTCC GATCGTTGTC AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG 804 0 
CAGCACTGCA TAATTCTCTT ACTGTCATGC CATCCGTAAG ATGCTTTTCT GTGACTGGTG 8100 
AGTACTCAAC CAAGTCATTC TGAGAATAGT GTATGCGGCG ACCGAGTTGC TCTTGCCCGG 8160 
CGTCAATACG GGATAATACC GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA 8220 
AACGTTCTTC GGGGCGAAAA CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTT CGATGT 82 80 
AACCCACTCG TGCACCCAAC TGATCTTCAG CATCTTTTAC TTTCACCAGC GTTTCTGGGT 834 0 
GAGCAAAAAC AGGAAGGCAA AATGCCGCAA AAAAGGGAAT AAGGGCGACA CGGAAATGTT 84 00 
GAATACTCAT ACTCTTCCTT TTTCAATATT ATTGAAGCAT TTATCAGGGT TATTGTCTCA 84 6 0 
TG AG CGGAT A CATATTTGAA TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT 852 0 
TTCCCCGAAA AGTGCCACCT 854 0 
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(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 09 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 12 0 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 

ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 240 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 3 00 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 36 0 

AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 

GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 480 

ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 540 

TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 600 

AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 660 

ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 72 0 

ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT ACCGGTTTGA CTCACGCGGA 780 

TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 840 

GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 

CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 96 0 

CATCACAGAT CTCTCACTAT GGATTTTCAG GTGCAGATTA TCAGCTTCCT GCTAATCAGT 1020 

GCTTCAGTCA TAATGTCCAG AGGACAAATT GTTCTCTCCC AGTCTCCAGC AATCCTGTCT 1080 

GCATCTCCAG GGGAGAAGGT CACAATGACT TGCAGGGCCA GCTCAAGTGT AAGTTACATC 114 0 

CACTGGTTCC AGCAGAAGCC AGGATCCTCC CCCAAACCCT GGATTTATGC CACATCCAAC 1200 

CTGGCTTCTG GAGTCCCTGT TCGCTTCAGT GGCAGTGGGT CTGGGACTTC TTACTCTCTC 1260 

ACAATCAGCA GAGTGGAGGC TGAAGATGCT GCCACTTATT ACTGCCAGCA GTGGACTAGT 1320 

AACCCACCCA CGTTCGGAGG GGGGACCAAG CTGGAAATCA AACGTACGGT GGCTGCACCA 1380 

TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 1440 

TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 1500 

CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 1560 

AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 1620 

TGCGAAGTCA CCCATCAGGG CCTGAGCTCG CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 1680 

TGTTGAATTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT GACAACATGC 174 0 

GGCCGTGATA TCTACGTATG ATCAGCCTCG ACTGTGCCTT CTAGTTGCCA GCCATCTGTT 1800 

GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC I86 0 

TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 1920 

GGGGTGGGGC AGGACAGCAA GGGGGAGGAT TGGGAAGACA ATAGCAGGCA TGCTGGGGAT 1980 

GCGGTGGGCT CTATGGAACC AGCTGGGGCT CGACAGCTAT GCCAAGTACG CCCCCTATTG 2 040 

ACGT C AATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 2100 

TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT T AC C ATGGTG ATGCGGTTTT 2160 

GGCAGTACAT CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC 222 0 

CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 2280 

GTAACAACTC CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 234 0 

TAAGCAGAGC TGGGTACGTC CTCACATTCA GTGATCAGCA CTGAACACAG ACCCGTCGAC 24 00 

ATGGGTTGGA GCCTCATCTT GCTCTTCCTT GTCGCTGTTG CTACGCGTGT CCTGTCCCAG 2460 

GTACAACTGC AGCAGCCTGG GGCTGAGCTG GTGAAGCCTG GGGCCTCAGT GAAGATGTCC 2520 

TGCAAGGCTT CTGGCTACAC ATTTACCAGT TACAATATGC ACTGGGTAAA ACAGACACCT 2 580 
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2640 
2700 
2760 
2820 
2880 
2940 



3120 
3180 
3240 



3480 
3540 
3600 
3660 



GGTCGGGGCC TGGAATGGAT TGGAGCTATT TATCCCGGAA ATGGTGATAC TTCCTACAAT 
CAGAAGTTCA AAGGCAAGGC CACATTGACT GCAGACAAAT CCTCCAGCAC AGC CTACATG 
CAGCTCAGCA GCCTGACATC TGAGGACTCT GCGGTCTATT ACTGTGCAAG ATCGACTTAC 
TACGGCGGTG ACTGGTACTT CAATGTCTGG GGCGCAGGGA CCACGGTCAC CGTCTCTGCA 
GCTAGCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 

TCgSSg GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 3000 

SaCTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 3060 

TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGCAGAGCCC "™ 
AAATCTTGTG ACAAAACTCA CACATGCCCA CCGTGCCCAG CACCTGAACT CCTGGGGGGA 
CCGTCAGTCT TCCTCTTCCC CCCAAAACCC AAGGACACCC TCATGATCTC CCGGACCCCT 

GAGGTCACAT GCGTGGTGGT GGACGTGAGC CACGAAGACC CTGAGGTCAA GTTCAACTGG 3300 

TACGTGGACG GCGTGGAGGT GCATAATGCC AAGACAAAGC CGCGGGAGGA GCAGTACAAC 3360 

AGCACGTACC GTGTGGTCAG CGTCCTCACC GTCCTGCACC AGGACTGGCT GAATGGCAAG 342 0 

gISSgt gcaaggtctc caacaaagcc ctcccagccc ccatcgagaa aaccatctcc — 

AAAGCCAAAG GGCAGCCCCG AGAACCACAG GTGTACACCC TGCCCCCATC CCGGGATGAG 
CTGACCAAGA AC CAGGTC AG CCTGACCTGC CTGGTCAAAG GCTTCTATCC CAGCGACATC 
GCCGTGGAGT GGGAGAGCAA TGGGCAGCCG GAGAACAACT ACAAGACCAC GCCTCCCGTG 

CTGGACTCCG ACGGCTCCTT CTTCCTCTAC AGCAAGCTCA CCGTGGACAA GAGCAGGTGG 3720 

CAGCAGGGGA ACGTCTTCTC ATGCTCCGTG ATGCATGAGG CTCTGCACAA CCACTACACG 3780 

CAGAAGAGCC TCTCCCTGTC TCCGGGTAAA TGAGGATCCG TTAACGGTTA CCAACTACCT 3840 

AgISSaTT CGTGACAACA TGCGGCCGTG AT AT CTACGT ATGATCAGCC TCGACTGTGC 3900 

CTTCTAGTTG CCAGCCATCT GTTGTTTGCC CCTCCCCCGT GCCTTCCTTG AC CCTGGAAG 3960 

GTGCCACTCC CACTGTCCTT TCCTAATAAA ATGAGGAAAT TGCATCGCAT TGTCTGAGTA 4020 

GGTGTCATTC TATTCTGGGG GGTGGGGTGG GGCAGGACAG CAAGGGGGAG GATTGGGAAG 4080 

ACAATAGCAG GCATGCTGGG GATGCGGTGG GCTCTATGGA AC CAGCTGGG GCTCGACAGC 4140 

GCTGGATCTC CCGATCCCCA GCTTTGCTTC TCAATTTCTT ATTTGCATAA TGAGAAAAAA 4200 

AGGAAAATTA ATTTTAACAC CAATTCAGTA GTTGATTGAG CAAATGCGTT GCCAAAAAGG 4260 

ATGCTTTAGA GACAGTGTTC TCTGCACAGA TAAGGACAAA CATTATTCAG AGGGAGTACC 4320 

CAGAGCTGAG ACTCCTAAGC CAGTGAGTGG CACAGCATTC TAGGGAGAAA TATGCTTGTC 4380 

ATCACCGAAG CCTGATTCCG TAGAGC C AC A CCTTGGTAAG GGCCAATCTG CTCACACAGG 4440 

ATAGAGAGGG CAGGAGCCAG GGCAGAGCAT ATAAGGTGAG GTAGGATCAG TTGCTCCTCA 4500 

CATTTGCTTC TGACATAGTT GTGTTGGGAG CTTGGATAGC TTGGACAGCT CAGGGCTGCG 4560 

ATTTCGCGCC AAACTTGACG GCAATCCTAG CGTGAAGGCT GGTAGGATTT TATCCCCGCT 4620 

GCCATCATGG TTCGACCATT GAACTGCATC GTCGCCGTGT C C C AAAATAT GGGGATTGGC 4680 

AAGAACGGAG ACCTACCCTG GCCTCCGCTC AGGAACGAGT TCAAGTACTT CCAAAGAATG 4740 

ACCACAACCT CTTCAGTGGA AGGTAAACAG AATCTGGTGA TTATGGGTAG GAAAACCTGG 4800 

TTCTCCATTC CTGAGAAGAA TCGACCTTTA AAGGACAGAA TTAATATAGT TCTCAGTAGA 4860 

GAACTCAAAG AACCACCACG AGGAGCTCAT TTT CTTGCCA AAAGTTTGGA TGATGCCTTA 4920 

AGACTTATTG AACAACCGGA ATTGGCAAGT AAAGTAGACA TGGTTTGGAT AGTCGGAGGC 4980 
AGTT CTGTTT ACCAGGAAGC CATGAATCAA CCAGGCCACC TTAGACTCTT TGTGACAAGG 504 0 
AT C ATGCAGG AATTTGAAAG TGACACGTTT TTCCCAGAAA TTGATTTGGG GAAATATAAA 5100 
CTTCTCCCAG AATACCCAGG CGTCCTCTCT GAGGTCCAGG AGGAAAAAGG CATCAAGTAT 5160 
AAGTTTGAAG TCTACGAGAA GAAAGACTAA CAGGAAGATG CTTT CAAGTT CTCTGCTCCC 5220 
CTCCTAAAGC TATGCATTTT TATAAGACCA TGGGACTTTT GCTGGCTTTA GATCAGCCTC 5280 
GACTGTGCCT TCTAGTTGCC AGCCATCTGT TGTTTGCCCC TCCCCCGTGC CTTCCTTGAC 5340 
CCTGGAAGGT GCCACTCCCA CTGTCCTTTC CTAATAAAAT GAGGAAATTG CATCGCATTG 5400 
TCTGAGTAGG TGTCATTCTA TTCTGGGGGG TGGGGTGGGG CAGGACAGCA AGGGGGAGGA 5460 
TTGGGAAGAC AATAGCAGGC ATGCTGGGGA TGCGGTGGGC TCTATGGAAC CAGCTGGGGC 5520 
TCGAGCTACT AGCTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 5580 
AATTTTAACA C C AATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 5640 
AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 5700 
GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CATCACCGAA 5760 
GCCTGATTCC GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG GATAGAGAGG 582 0 
GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC ACATTTGCTT 5880 
CTGACATAGT TGTGTTGGGA GCTTGGATCG ATCCTCTATG GTTGAACAAG ATGGATTGCA 5940 
CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG GCTATTCGGC TATGACTGGG CACAACAGAC 6000 
AATCGGCTGC TCTGATGCCG CCGTGTTCCG GCTGTCAGCG CAGGGGCGCC CGGTTCTTTT 6060 



TPTCAAGACC GACCTGTCCG GTGCCCTGAA TGAACTGCAG GACGAGGCAG CGCGGCTATC 6120 
SgGCTGGCC aSSSgCG TTCCTTGCGC AGCTGTGCTC GACGTTGTCA CTGAAGCGGG 6180 
SgGGaSgG CTgStaTTGG GCGAAGTGCC GGGGCAGGAT CTCCTGTCAT CTCACCTTGC 6240 

tcSgJcSS aaagtatcca TCATGGCTGA TGCAATGCGG cggctgcata cgcttgatcc 
ggSacSSc Sattcgacc accaagcgaa acatcgcatc gagcgagcac gtactcggat 
ggaSccgg? cttgtcgatc aggatgatct ggacgaagag catcaggggc tcgcgccagc 
PCAACTGTTC gccaggctca aggcgcgcat gcccgacggc gaggatctcg tcgtgaccca 
?g55ga?gcc ?gSSccga atatcatggt ggaaaatggc cgcttttctg gattcatcga 
SgtgSSSg ctgggtgtgg cggaccgcta tcaggacata gcgttggcta cccgtgatat 
tgSgaagag cttggcggcg aatgggctga ccgcttcctc gtgctttacg gtatcoccgc 

ESSESS CAGCGCATCG CCTTCTATCG CCTTCTTGAC GAGTTCTTCT GAGCGGGACT 
PTPGGGTTCG AAATGACCGA CCAAGCGACG CCCAACCTGC CATCACGAGA TTTCGATTCC 

otSSaAAG GTTGGGCTTC GGAATCGTTT TCCGGGACGC CGGCTGGATG 6840 
ATcScSS SSSLttCT CATGCTGGAG TTCTTCGCCC ACCCCAACTT GTTTATTGCA 6900 
GCTTATAATG GTTACAAATA AAGCAATAGC ATCACAAATT TCACAAATAA AGCATTTTTT 

tSSgStt Sagttgtgg tttgtccaaa ctcatcaatc tatcttatca tgtctggatc 

GcSgcSSgA TCCC6TCGAG AGCTTGGCGT AATCATGGT C ATAGCTGTTT CCTGTGTGAA 
attottatS gctcacaatt CCACACAACA TACGAGCCGG AAGCATAAAG TGTAAAGCCT 
SggSSa aSStGAGC TAACTCACAT TAATTGCGTT GCGCTCACTG CCCGCTTTCC 7200 
SSSE CCTGTCGTGC CAGCTG^TT AATGAATCGG C^CGCGCO GGGAGAGG^ 7260 

7380 
7440 
7500 
7560 



6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 



StSgStIt tgggcgctct tccgctt^ct Sctcactga ctcgctgcgc tcggtcgttc 
ggSgcScg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag 
gggataaJS aggaaagaac -atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa 
agScXgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc 
SacgScaag tcSSSgg cgaaacccga caggactata aagataccag gcgtttcccc 

SgGaIgSc CCTCGTGCGC TCTCCTGTTC CGACCCTGCC GCTTACCGGA TACCTGTCCG 7620 
CcS^Sc TTCGGGAAGC GTGGCGCTTT CTCAATGCTC ACGCTGTAGG TATCTCAGTT 7680 

cggSSggt CGTTCGCTCC aagctgggct gtgtgcacga accccccgtt cagcccgacc 
gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc 

ScTGGcS ScCACTGGT AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG 7860 
SSSSXa OTGotScCT AACTACGGCT ACACTAGAAG GACAGTATTT GGTATCTGCG 7920 

JtSgSgaI gcSag?tacc TTCGGAAAAA GAGTTGGTAG ctcttgatcc ggcaaacaaa 
cSSgctS SSggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag 
gatc?caaga agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact 
ScSSaS gIStttggtc atgagattat caaaaaggat cttcacctag atccttttaa 

ATTAAAAATG aStTAAA TCAATCTAAA GTATATATGA GTAAACTTGG TCTGACAGTT 8220 

ACCAATGCTT AATCAGTGAG GCACCTATCT CAGCGATCTG TCTATTTCGT TCATCCATAG 8280 

iS^SSS cISStCGTG TAGATAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA 8340 

SgCTgSS GATACCGCGA GACCCACGCT CACCGGCTCC AGATTTATCA GCAATAAACC 8400 

Zc^tc^l SgGGCCGAG CGCAGAAGTG GTCCTGCAAC TTTATCCGCC TCCATCCAGT 8460 

CTaSaATTG TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC AGTTAATAGT TTGCGCAACG 8520 

SSSSSm TGCTACAGGC ATCGTGGTGT cacgctcgtc gtttggtatg gcttcattca 
gSSgStc ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg 
ttIgctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca 

ScttItGGC AGCACTGCAT AATTCTCTTA CTGTCATGCC ATCCGTAAGA TGCTTTTCTG 
tSSgctga gtactcaacc AAGTCATTCT GAGAATAGTG TATGCGGCGA ccgagttgct 
Stccccggc gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca 

TCATTGGAAA ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT CTTACCGCTG TTGAGATCCA 8940 
gScgSg^A AcSaCTCGT GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG 9000 
StSSStS AGCAAAAACA GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC 

SaaSgtS aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt 

ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTC 
CGCGCACATT TCCCCGAAAA GTGCCACCT 

(2) INFORMATION FOR SEQ ID NO : 4 : 



7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 



8520 
8580 
8640 
8700 
8760 
8820 
8880 



9060 
9120 
9180 
9209 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 47 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
ATCACAGATC TCTCACCATG GATTTTCAGG TGCAGATTAT CAGCTTC 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
TGCAGCATCC GTACGTTTGA TTTCCAGCTT 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..384 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

ATG GAT TTT CAG GTG CAG ATT ATC AGC TTC CTG CTA ATC AGT GCT TCA 
Met Asp Phe Gin Val Gin He He Ser Phe Leu Leu He Ser Ala Ser 
15 10 15 

GTC ATA ATG TCC AGA GGG CAA ATT GTT CTC TCC CAG TCT CCA GCA ATC 
Val He Met Ser Arg Gly Gin He Val Leu Ser Gin Ser Pro Ala He 
20 25 30 
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336 



384 



CTG TCT GCA TCT CCA GGG GAG AAG GTC ACA ATG ACT TGC AGG GCC AGC 144 
Leu Ser Ala Ser Pro Gly Glu Lys Val Thr Met Thr Cys Arg Ala Ser 
35 40 45 

TCA AGT GTA AGT TAC ATC CAC TGG TTC CAG CAG AAG CCA GGA TCC TCC 192 
Ser Ser Val Ser Tyr He His Trp Phe Gin Gin Lys Pro Gly Ser Ser 
50 55 SO 

CCC AAA CCC TGG ATT TAT GCC ACA TCC AAC CTG GCT TCT GGA GTC CCT 240 
Pro Lys Pro Trp He Tyr Ala Thr Ser Asn Leu Ala Ser Gly Val Pro 
65 70 75 80 

GTT CGC TTC AGT GGC AGT GGG TCT GGG ACT TCT TAC TCT CTC ACA ATC 288 
Val Arg Phe Ser Gly Ser Gly Ser Gly Thr Ser Tyr Ser Leu Thr He 
85 90 95 

AGC AGA GTG GAG GCT GAA GAT GCT GCC ACT TAT TAC TGC CAG CAG TGG 
Ser Arg Val Glu Ala Glu Asp Ala Ala Thr Tyr Tyr Cys Gin Gin Trp 
100 105 HO 

ACT AGT AAC CCA CCC ACG TTC GGA GGG GGG ACC AAG CTG GAA ATC AAA 
Thr Ser Asn Pro Pro Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Asp Phe Gin Val Gin He He Ser Phe Leu Leu He Ser Ala Ser 
1 5 10 15 

Val He Met Ser Arg Gly Gin He Val Leu Ser Gin Ser Pro Ala He 
20 25 30 

Leu Ser Ala Ser Pro Gly Glu Lys Val Thr Met Thr Cys Arg Ala Ser 
35 40 45 

Ser Ser Val Ser Tyr He His Trp Phe Gin Gin Lys Pro Gly Ser Ser 
50 55 60 

Pro Lys Pro Trp He Tyr Ala Thr Ser Asn Leu Ala Ser Gly Val Pro 
65 70 75 80 

val Arg Phe Ser Gly Ser Gly Ser Gly Thr Ser Tyr Ser Leu Thr He 
85 90 95 

Ser Arg Val Glu Ala Glu Asp Ala Ala Thr Tyr Tyr Cys Gin Gin Trp 
100 105 HO 



10 



Thr Ser Asn Pro Pro Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys 
115 120 I 25 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GCGGCTCCCA CGCGTGTCCT GTCCCAG 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GGSTGTTGTG CTAGCTGMRG AGACRGTGA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..420 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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ATG GGT TGG AGC CTC ATC TTG CTC TTC CTT GTC GCT GTT GCT ACG CGT 48 
Met Gly Trp Ser Leu lie Leu Leu Phe Leu Val Ala Val Ala Thr Arg 
15 10 15 

GTC CTG TCC CAG GTA CAA CTG CAG CAG CCT GGG GCT GAG CTG GTG AAG 96 
Val Leu Ser Gin Val Gin Leu Gin Gin Pro Gly Ala Glu Leu Val Lys 
20 25 30 

CCT GGG GCC TCA GTG AAG ATG TCC TGC AAG GCT TCT GGC TAC ACA TTT 144 
Pro Gly Ala Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe 
35 40 45 

ACC AGT TAC AAT ATG CAC TGG GTA AAA CAG ACA CCT GGT CGG GGC CTG 192 
Thr Ser Tyr Asn Met His Trp Val Lys Gin Thr Pro Gly Arg Gly Leu 
50 55 60 

GAA TGG ATT GGA GCT ATT TAT CCC GGA AAT GGT GAT ACT TCC TAC AAT 240 
Glu Trp He Gly Ala He Tyr Pro Gly Asn Gly Asp Thr Ser Tyr Asn 
65 70 75 80 

CAG AAG TTC AAA GGC AAG GCC ACA TTG ACT GGA GAC AAA TCC TCC AGC 288 
Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser 
85 90 95 

ACA GCC TAC ATG CAG CTC AGC AGC CTG ACA TCT GAG GAC TCT GCG GTC 33 6 

Thr Ala Tyr Met Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val 
100 105 HO 

TAT TAC TGT GCA AGA TCG ACT TAC TAC GGC GGT GAC TGG TAC TTC AAT 384 
Tvr Tyr Cys Ala Arg Ser Thr Tyr Tyr Gly Gly Asp Trp Tyr Phe Asn 
115 120 125 



GTC TGG GGC GCA GGG ACC ACG GTC ACC GTC TCT GCA 
Val Trp Gly Ala Gly Thr Thr Val Thr Val Ser Ala 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Met Gly Trp Ser Leu He Leu Leu Phe Leu Val Ala Val Ala Thr Arg 
1 5 10 I 5 

Val Leu Ser Gin Val Gin Leu Gin Gin Pro Gly Ala Glu Leu Val Lys 

25 30 



420 



Pro Gly Ala Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe 
35 40 45 
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Thr Ser Tyr Asn Met His Trp Val Lys Gin Thr Pro Gly Arg Gly Leu 
50 55 60 

Glu Trp lie Gly Ala He Tyr Pro Gly Asn Gly Asp Thr Ser Tyr Asn 
65 70 75 80 

Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser 
85 90 95 

Thr Ala Tyr Met Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val 
100 105 11° 

Tyr Tyr Cys Ala Arg Ser Thr Tyr Tyr Gly Gly Asp Trp Tyr Phe Asn 
115 120 I 25 

Val Trp Gly Ala Gly Thr Thr Val Thr Val Ser Ala 
130 135 140 



